Syllable Detection and Segmentation Using Temporal Flowneural Networks

نویسندگان

  • Lokendra Shastri
  • Shuangyu Chang
  • Steven Greenberg
چکیده

The syllable serves as an important interface between the lowerlevel (phonetic and phonological) and the higher-level (morphological and lexical) representational tiers of language. It has been demonstrated that reliable segmentation of spontaneous speech into syllabic entities is useful for speech recognition. An automatic method is described for delineating the temporal boundaries of syllabic units in continuous speech using a Temporal Flow Model (TFM) and modulation-filtered spectral features. The TFM is a neural network architecture that supports arbitrary connectivity across layers, provides for feed-forward as well as recurrent links, and allows variable propagation delays along links. Two TFM configurations, global and tonotopic, have been developed and trained on a phonetically transcribed corpus of telephone and address numbers spoken over the telephone by several hundred individuals of variable dialect, age and gender. The networks reliably detected the boundaries of syllabic entities with an accuracy of ca. 84%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images

The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...

متن کامل

Score-Informed Syllable Segmentation for A Cappella Singing Voice with Convolutional Neural Networks

This paper introduces a new score-informed method for the segmentation of jingju a cappella singing phrase into syllables. The proposed method estimates the most likely sequence of syllable boundaries given the estimated syllable onset detection function (ODF) and its score. Throughout the paper, we first examine the jingju syllables structure and propose a definition of the term “syllable onse...

متن کامل

Relative Functional Comparison of Neural and Non- Neural Approaches for Syllable Segmentation in Devnagari TTS System. Prof Mrs

This paper presents methods for automatic speech signal segmentation using neural network. Speech signal segmentation is carried out to form syllables. Syllable is a common unit for concatenative TTS systems. Concatenative TTS being using speech segments of recorded speech is natural as compare to Formant or Articulatory TTS systems. This TTS stores small segments of speech and join them togeth...

متن کامل

کاهش رنگ تصاویر با شبکه‌های عصبی خودسامانده چندمرحله‌ای و ویژگی‌های افزونه

Reducing the number of colors in an image while preserving its quality, is of importance in many applications such as image analysis and compression. It also decreases memory and transmission bandwidth requirements. Moreover, classification of image colors is applicable in image segmentation and object detection and separation, as well as producing pseudo-color images. In this paper, the Kohene...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999